-
Notifications
You must be signed in to change notification settings - Fork 537
support input_pos > 0 for prefill model #8127
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Summary: test input_pos>0 for prefill, not intention for landing but for sync with qc Differential Revision: D68847677
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/8127
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 2 Unrelated FailuresAs of commit 38d22fb with merge base 92e7dbd ( NEW FAILURES - The following jobs have failed:
BROKEN TRUNK - The following jobs failed but were present on the merge base:👉 Rebase onto the `viable/strict` branch to avoid these failures
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This pull request was exported from Phabricator. Differential Revision: D68847677 |
This PR needs a
|
let me explain a little on this, Tokens, freq_cos, freq_sin, mask, k, v caches will be passed in both prefill and decode model |
Summary: test input_pos>0 for prefill, not intention for landing but for sync with qc
Differential Revision: D68847677